NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Subset Node Anomaly Tracking over Large Dynamic Graphs

https://doi.org/10.1145/3534678.3539389

Guo, Xingzhi; Zhou, Baojian; Skiena, Steven (August 2022, Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Subset Node Representation Learning over Large Dynamic Graphs

https://doi.org/10.1145/3447548.3467393

Guo, Xingzhi; Zhou, Baojian; Skiena, Steven (August 2021, Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Full Text Available
Dual Averaging Method for Online Graph-structured Sparsity

https://doi.org/10.1145/3292500.3330915

Zhou, Baojian; Chen, Feng; Ying, Yiming (August 2019, SIGKDD Conference on Knowledge Discovery and Data Mining (KDD))

Full Text Available
Detecting Media Self-Censorship without Explicit Training Data

https://doi.org/10.1137/1.9781611976236.62

Tao, Rongrong; Zhou, Baojian; Chen, Feng; Mares, Dvid; Butler, Patrick; Ramakrishnan, Naren; Kennedy, Ryan (July 2020, Proceedings of the 2020 SIAM International Conference on Data Mining)
Demeniconi; Carlotta; Nitesh V. Chawla (Ed.)
The motives and means of explicit state censorship have been well studied, both quantitatively and qualitatively. Self-censorship by media outlets, however, has not received nearly as much attention, mostly because it is difficult to systematically detect. We develop a novel approach to identify news media self-censorship by using social media as a sensor. We develop a hypothesis testing framework to identify and evaluate censored clusters of keywords and a near-linear-time algorithm (called GraphDPD) to identify the highest-scoring clusters as indicators of censorship. We evaluate the accuracy of our framework, versus other state-of-the-art algorithms, using both semi-synthetic and real-world data from Mexico and Venezuela during Year 2014. These tests demonstrate the capacity of our framework to identify self-censorship and provide an indicator of broader media freedom. The results of this study lay the foundation for detection, study, and policy-response to self-censorship.
more » « less
Full Text Available
Dual Averaging Method for Online Graph-structured Sparsity

https://doi.org/10.1145/3292500.3330915

Zhou, Baojian; Chen, Feng; Ying, Yiming (July 2019, Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining)

Online learning algorithms update models via one sample per iteration, thus efficient to process large-scale datasets and useful to detect malicious events for social benefits, such as disease outbreak and traffic congestion on the fly. However, existing algorithms for graph-structured models focused on the offline setting and the least square loss, incapable for online setting, while methods designed for online setting cannot be directly applied to the problem of complex (usually non-convex) graph-structured sparsity model. To address these limitations, in this paper we propose a new algorithm for graph-structured sparsity constraint problems under online setting, which we call GraphDA. The key part in GraphDA is to project both averaging gradient (in dual space) and primal variables (in primal space) onto lower dimensional subspaces, thus capturing the graph-structured sparsity effectively. Furthermore, the objective functions assumed here are generally convex so as to handle different losses for online learning settings. To the best of our knowledge, GraphDA is the first online learning algorithm for graph-structure constrained optimization problems. To validate our method, we conduct extensive experiments on both benchmark graph and real-world graph datasets. Our experiment results show that, compared to other baseline methods, GraphDA not only improves classification performance, but also successfully captures graph-structured features more effectively, hence stronger interpretability.
more » « less
Full Text Available
Stochastic Iterative Hard Thresholding for Graph-structured Sparsity Optimization

Zhou, Baojian; Chen, Feng; Ying, Yiming (June 2019, Proceedings of the 36th International Conference on Machine Learning, Long Beach, California, PMLR 97)

Full Text Available
Stochastic Iterative Hard Thresholding for Graph-structured Sparsity Optimization

Zhou, Baojian; Chen, Feng; Ying, Yiming (June 2019, Proceedings of the 36th International Conference on Machine Learning)

Stochastic optimization algorithms update models with cheap per-iteration costs sequentially, which makes them amenable for large-scale data analysis. Such algorithms have been widely studied for structured sparse models where the sparsity information is very specific, e.g., convex sparsity-inducing norms or ℓ0-norm. However, these norms cannot be directly applied to the problem of complex (non-convex) graph-structured sparsity models, which have important application in disease outbreak and social networks, etc. In this paper, we propose a stochastic gradient-based method for solving graph-structured sparsity constraint problems, not restricted to the least square loss. We prove that our algorithm enjoys a linear convergence up to a constant error, which is competitive with the counterparts in the batch learning setting. We conduct extensive experiments to show the efficiency and effectiveness of the proposed algorithms.
more » « less
Full Text Available

Search for: All records